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DETAILED ACTION 
Remarks 

1 . In response to the interview of 21 August 2007, claims 1 , 9, 22, 23 are amended, 
claims 4, 8, 10, 14-21 are cancelled, and claims 24-31 are added per applicant's 
request. Claims 1-3, 5-7, 9, 11-13, 22-31 are pending in the application. 

EXAMINER'S AMENDMENT 

2. An examiner's amendment to the record appears below. Should the changes 
and/or additions be unacceptable to applicant, an amendment may be filed as provided 
by 37 CFR 1 .312. To ensure consideration of such an amendment, it MUST be 
submitted no later than the payment of the issue fee. 

Authorization for this examiner's amendment was given in a telephone interview with 
Joseph Juliano on 29 August 2007. 

3. In Claims: 

1 . (Currently Amended) A computer-implemented method of executing a query on a 
data repository, the method comprising: 

receiving a query for execution on data in the data repository; 
generating an estimate of a number of results of the query in accordance with the 
follow i ng estimation functions: 
est(NOT) =N-est(op) x 
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est{OR) = es t (o Pl ) + est(op 2 )- eSt(op ^ eStioP2) x 

N 

where op is an operand, esfl) returns an estimate of a number of results 

matching the an operator or the operand in parenthesis, and N is the total 

number of possible results ; 

defining a subset of data in the data repository; 

determining whether to execute the query on the subset of the data, the query to 
be executed on the subset of the data if the estimate of the number of results of the 
query indicates that executing the query on the subset of the data is estimated to 
return a number of results greater than a threshold number; 

if tho quory i s to bo oxecutod on tho subset of tho data, ex e cut i ng tho quory on 
tho subset of tho data to gonorato a part i a l set of r e su l ts, otherw i se oxocut i ng the 
qu e ry on th e data r e pository to g e n e rat e a compl e t e s e t of r e su l ts; and 

executing the query on the subset of the data to generate a partial set of results 
based on the determining whether to execute the guerv on the subset of the data 
resulting in a determination to execute the guerv on the subset of a data; otherwise 
executing the guerv on the data repository to generate a complete set of results 
based on the determining ; and 

providing qu e ry r e sults the partial set of results of the execution of the guerv or 
the complete set of results of the execution of the guerv . 

2. (Currently Amended) [[A]] The method in accordance w i th otclaim 1 , wherein 
providing query results comprises making the query results available to an application 
program. 

3. (Currently Amended) [[A]] The method in accordanc e with of claim 2, further 
comprising: 

the application program providing query results to a user interface. 



4. (Canceled) 
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5. (Currently Amended) [[A]] The method in accordanc e with of claim 1 , wherein 
determining whether to execute the query on the subset of the data comprises 
estimating whether executing the query on the subset of the data would generate a 
desired number of results, the method further comprising: 

receiving a value representing the desired number of results. 

6. (Currently Amended) [[A]] The method i n accordanco w i th of claim 1 , wherein: 

the method further comprises receiving a value representing a desired number of 
results; 

the query is to be executed on the subset of the data if the estimate of the 
number of results of the query is greater than a weighted subset estimate generated 
in accordance with estimation function: 

fl* — — ♦F, where R is the number of results desired, N is the total number 

stripeSize 

of possible results, F is a safety factor , and stripeSize is the size of the subset of the 
data; and 

determining whether to execute the query on the subset of the data comprises: 
generating the weighted subset estimate; and 

determining whether the estimate of the number of results of the query is 
greater than the weighted subset estimate. 

7. (Currently Amended) [[A]] Jhe method in accordanc o w i th of claim 1 further 
comprising: 

in response to executing the query on a previous subset of the data, determining 
whether a sufficient number of results have been generated; and 

if a sufficient number of results have been generated, defining a next subset of 
the data in the data repository and executing the query on the next subset of the 
data, otherwise executing the query on the data repository. 
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8. (Canceled) 

9. (Currently Amended) An information management system including a processor , the 
system comprising: 

a data repository, wherein the data repository is configured to store data; and 
one or more processes for executing queries on the data repository, wherein the 
one or more processes are operative to: 

receive a query for execution on data in the data repository; 
generate an estimate of a number of results of the query in accordance with 
tho fol l owing estimation functions: 
est(NOT) =N-est(pp) x 



est(AND) = 



est(pp x )* est(op 2 ) 
N 



. and 



est(OR) = est(op } ) + est(op 2 ) — 



est(op t )* est(op 2 ) 
N 
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where op is an operand, esK) returns an estimate of a number of results 
matching toe an operator or operand in parenthesis, and N is the total number 
of possible results : 

define a subset of data in the data repository; 

determine whether to execute the query on the subset of the data, the query to 
be executed on the subset of the data if the estimate of the number of results of 
the query indicates that executing the query on the subset of the data is estimated 
to return a number of results greater than a threshold number; 

execute the query on the subset of the data to generate a partial set of results 
based on the determining whether to execute the guery on the subset of the data 
resulting in a determination to execute the guery on the subset of a data; 
otherwise executing the guery on the data repository to generate a complete set 
of results based on the determining ; and 

if tho quory is to bo oxocuted on the subset of the data, oxocuto the query on 
th o subs e t of th o data to gonorato a part i a l s e t of resu l ts, otherw i s e e xocuto tho 
quory on th e data repos i tory to g e n e rate a comp l oto set of results; and 

provide auorv results the partial set of results of the execution of the guery or 
the complete set of results of the execution of the guery . 

10. (Canceled) 

1 1 . (Currently Amended) Ab The information management system i n accordance with 
of_claim 9, wherein the operation of providing query results comprises making the query 
results available to an application program. 

12. (Currently Amended) An The information management system i n accordanc e w i th 
of_claim 9, wherein the operation of determining whether to execute the query on the 
subset of the data comprises estimating whether executing the query on the subset of 
the data would generate a desired number of results, the one or more processes are 
further operative to: 
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receive a value representing the desired number of results. 

13. (Currently Amended) An The information management system i n accordanc e w i th 
of_claim 9, wherein the one or more processes are further operative to: 

in response to executing the query on a previous subset of the data, determine 
whether a sufficient number of results have been generated; and 

if a sufficient number of results have been generated, define a next subset of the 
data in the data repository and execute the query on the next subset of the data, 
otherwise execute the query on the data repository. 

14. (Canceled) 

15. (Canceled) 

16. (Canceled) 

1 7. (Canceled) 

18. (Canceled) 

19. (Canceled) 

20. (Canceled) 

21. (Canceled) 

22. (Currently Amended) A computer program product, tangibly e mbod i od on an 
i nformat i on carr ie r stored on a computer-readable storage medium , the computer 
program product comprising instructions operable to cause data processing apparatus 
to: 

generate a weighted subset estimate of performing a query on a data repository 
in accordance with tho fo ll ow i ng estimation function: 

N 

R * * F , where R is a number of results desired, N is the total number of 

stripeSize 

possible results, F is a safety factor , and stripeSize is the size of the subset of the 
data; 
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determine to execute the query on a subset of data in the data repository if the 
weighted subset estimate is greater than an estimate of the number of results of the 
query; 

determine to execute the query on the data repository if the estimate of the 
number of results of the query is greater than the weighted subset estimate; 

execute the query on the subset of data in the data repository to generate a 
partial set of results based on the determining to execute the query on the subset of 
data in the data repository or execute the Query on the data repository to generate a 
complete set of results based on the determining to execute the query on the data 
repository ; and 

provide qu e ry r e su l ts the partial set of results of the execution of the query or the 
complete set of results of the execution of the query . 

23. (Currently Amended) A computer program product, tang i b l y ombodiod on an 
information carr ie r stored on a computer-readable storage medium , the computer 
program product comprising instructions operable to cause data processing apparatus 
to: 

generate an estimate of a number of results of a query in accordance with the 
fo l low i ng estimation functions: 

est(NOT) =N-est(op), 

, „^ est(op,)*est(op 2 ) . 
est(AND) = — N » and 

, , est(op,)*est(op 7 ) 
est (OR) = est(op x ) + est(op 2 ) ' 

where op is an operand, est{) returns an estimate of a number of results 
matching toe an operator or the operand in parenthesis, and N is the total 
number of possible results; 

determine to execute the query on a subset of data in the data repository if a 
weighted subset estimate is greater than the estimate of the number of results of the 
query; 
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determine to execute the query on the data repository if the estimate of the 
number of results of the query is greater than the weighted subset estimate; 

execute the query on the subset of data in the data repository to generate a 
partial set of results based on the determining to execute the query on the subset of 
data in the data repository or execute the guery on the data repository to generate a 
complete set of results based on the determining to execute the guerv on the data 
repository : and 

provide qu e ry r e sults the partial set of results of the execution of the guerv or the 
complete set of results of the execution of the guerv . 

24. (New) [[A]] The computer program product i n accordanoo with of claim 22, wherein 
the instructions are further operative to: 

in response to executing the query on a previous subset of the data, determine 
whether a sufficient number of results have been generated; and 

if a sufficient number of results have been generated, define a next subset of the 
data in the data repository and executing the query on the next subset of the data, 
otherwise executing the query on the data repository. 

25. (New) [[A]] The computer program product i n accordance w i th of claim 22, wherein 
the estimate of the number of results of the query is generated in accordance with 
estimation functions: 

est(NOT) =N-est(op), 

«^) = CT ^ ) ^ (qp » ) ,and 

est(OR) = „ f( ^) + „r(c ft )-5^a^2&) , 

where op is an operand, est{) returns an estimate of a number of results 
matching the an operator or the operand in the parenthesis, and N is the total 
number of possible results. 
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26. (New) [[A]] The computer program product in accordanco w i th of claim 23, wherein 
the instructions are further operable to cause the data processing apparatus to: 

in response to executing the query on a previous subset of the data, determine 
whether a sufficient number of results have been generated; and 

if a sufficient number of results have been generated, define a next subset of the 
data in the data repository and executing the query on the next subset of the data, 
otherwise executing the query on the data repository. 



27. (New) [[An]] The information management system in accordanco w i th of claim 9, 
wherein: 

the query is to be executed on the subset of the data if the estimate of the 
number of results of the query is greater than a weighted subset estimate generated 
in accordance with estimation function: 

N 

R * * F , where R is a number of results desired, N is the total number of 

stripeSize 

possible results, F is a safety factor , and stripeSize is the size of the subset of the 
data; and 

determining whether to execute the query on the subset of the data comprises: 
generating the weighted subset estimate; and 

determining whether the estimate of the number of results of the query is 
greater than the weighted subset estimate. 
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28. (New) An information management system including a processor , the system 
comprising: 

a data repository, wherein the data repository is configured to store data; and 
one or more processes for executing queries on the data repository, wherein the 
one or more processes are operative to: 

generate a weighted subset estimate of performing a query on the data 
repository in accordance with tho fol l ow i ng estimation function: 
N 

R* *F, where R is a number of results desired, N is the total 

stripeSize 

number of possible results, F is a safety factor, and stripeSize is the size of a 
subset of the data; 

determine whether to execute the query on the subset of the data in the data 
repository, the query to be executed on the subset of the data in the data 
repository if an estimate of a number of results of the query is greater than the 
weighted subset estimate; 

execute the query on the subset of data in the data repository to generate a 
partial set of results based on the determining whether to execute the query on 
the subset of the data in the data repository resulting in a determination to 
execute the query on the subset of the data; otherwise, executing the query on 
the data repository to generate a complete set of results based on the 
determining ;' and 

provide query r e su l ts the partial set of results of the execution of the guerv or 
the complete set of results of the execution of the guerv 

29. (New) [[An]] The information management system i n accordanco w i th of claim 28, 
wherein the one or more processes are further operative to: 

in response to executing the query on a previous subset of the data, determine 
whether a sufficient number of results have been generated; and 
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if a sufficient number of results have been generated, define a next subset of the 
data in the data repository and execute the query on the next subset of the data, 
otherwise execute the query on the data repository. 

30. (New) A computer implemented method comprising: 

generating a weighted subset estimate of performing a query on a data 
repository in accordance with the-estimation function: 

N 

R * * F , where R is a number of results desired, N is the total 

stripeSize 

number of possible results, F is a safety factor, and stripeSize is the size of a 
subset of data in the data repository; 

determining whether to execute the query on the subset of the data in the data 
repository, the query to be executed on the subset of the data in the data repository 
if an estimate of a number of results of the query is greater than the weighted subset 
estimate; 

executing the query on the subset of data in the data repository to generate a 
partial set of results based on the determining whether to execute the query on the 
subset of the data in the data repository resulting in a determination to execute the 
query on the subset of the data; otherwise, executing the query on the data 
repository to generate a complete set of results based on the determining : and 

providing quory resu l ts the partial set of results of the execution of the guerv or 
the complete set of results of the execution of the query 

31 . (New) [[A]] The method of claim 30 further comprising: 

in response to executing the query on a previous subset of the data, determining 
whether a sufficient number of results have been generated; and 

if a sufficient number of results have been generated, defining a next subset of 
the data in the data repository and executing the query on the next subset of the 
data, otherwise executing the query on the data repository. 
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Allowable Subject Matter 

4. Claims 1-3, 5-7, 9, 11-13, and 22-31. 

The following is an examiner's statement of reasons for allowance: 
The prior art of record, alone or in combination, does not teach or fairly suggest 
the subject matter of independent claims 22 and 23, wherein the limitations include a 
series of formulas for estimating the number of results that a query will return. The 
subject matter of claim 22 teaches wherein the estimate of the number of results of a 

N 

query can be determined by the formula of R * * F , where R is a number of 

stripeSize 

results desired, N is the total number of possible results, F is a safety factor, and 
stripeSize is the size of a subset of data in the data repository. The subject matter of 
claim 23 teaches a series of formulas for determining the estimate of the number of 
results of a query by using the Boolean operators of the query, notably NOT, AND, and 
OR. Specifically, the prior art of record does not teach or suggest 
est(NOT) =N-est(op), 

est(AND) = ^° P ^ N eSt ^\ a^ 

est(OR) = «/(o^) + «Kop 2 )- CT/( ^' ) ^ (qP2) , 

where op is an operand, est{) returns an estimate of a number of results 
matching the operator or operand in the parenthesis, and N is the total number of 
possible results. 
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Any comments considered necessary by applicant must be submitted no later 
than the payment of the issue fee and, to avoid processing delays, should preferably 
accompany the issue fee. Such submissions should be clearly labeled "Comments on 
Statement of Reasons for Allowance." 



Conclusion 

5. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Charles D. Adams whose telephone number is (571) 
272-3938. The examiner can normally be reached on 8:30 AM - 5:00 PM, M - F. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Charles Rones can be reached on (571) 272-4085. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 



Charles Adams 
AU2164 
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